Edge Container for Speech Recognition
نویسندگان
چکیده
Containerization has been mainly used in pure software solutions, but it is gradually finding its way into the industrial systems. This paper introduces edge container with artificial intelligence for speech recognition, which performs voice control function of actuator as a part Human Machine Interface (HMI). work proposes procedure creating voice-controlled applications modern hardware and resources. The created architecture integrates well-known digital technologies such containerization, cloud, computing commercial processing tool. methodology enable actual recognition on device local network, rather than like majority recent solutions. Linux containers are designed to run without any additional configuration setup by end user. A simple adaptation commands via file may be considered an contribution work. was verified experiments running different devices, PC, Tinker Board 2, Raspberry Pi 3 4. proposed solution practical experiment show how system can created, easily managed distributed many devices around world few seconds. All this achieved downloading two types ready-made complex installations. result proven stable (network-independent) data protection low latency.
منابع مشابه
Improving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملimproving the performance of mfcc for persian robust speech recognition
the mel frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. in this paper to achieve a satisfactorily performance in automatic speech recognition (asr) applications we introduce a noise robust new set of mfcc vector estimated through following steps. first, spectral mean normalization is a pre-processing which applies to t...
متن کاملImproved Bayesian Training for Context-Dependent Modeling in Continuous Persian Speech Recognition
Context-dependent modeling is a widely used technique for better phone modeling in continuous speech recognition. While different types of context-dependent models have been used, triphones have been known as the most effective ones. In this paper, a Maximum a Posteriori (MAP) estimation approach has been used to estimate the parameters of the untied triphone model set used in data-driven clust...
متن کامل\eigenlips" for Robust Speech Recognition \eigenlips" for Robust Speech Recognition
In this study we improve the performance of a hybrid connectionist speech recognition system by incorporating visual information about the corresponding lip movements. Speciically, we investigate the beneets of adding visual features in the presence of additive noise and crosstalk (cocktail party eeect). Our study extends previous experiments by using a new visual front end, and an alternative ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Electronics
سال: 2021
ISSN: ['2079-9292']
DOI: https://doi.org/10.3390/electronics10192420